SEQUENCE LISTING 



<110> Fritig, Bernard 
Toquin, Valerie 
Geoffroy, Pierrette 
Legrand, Michel 
Kauffmann, Serge 



<12 0> INDUCIBLE COMTII PROMOTER, CHIMERA GENE 
CONTAINING SANE AND TRANSFORMED PLANTS 



<130> A34638-PCT-USA-I (072667.0189) 

<14 0> To Be Assigned 
<141> 2003-08-04 

<150> US 09/937,204 
<151> 2001-12-13 

<150> PCT/FR00/00714 
<151> 2000-03-22 

<150> France 99/03700 
<151> 1999-03-22 

<150> France 99/07646 
<151> 1999-06-11 

<160> 26 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1863 
'<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> misc_signal 
<222> (667) . . . (672) 
<22 3> Inverted W box 

<221> misc_RNA 
<222> (820) . . . (830) 
<22 3> Inverted L box 

<221> enhancer 
<222> (845) . . . (852) 

<221> misc_signal 
<222> (1034) . . . (1047) 
<223> P box 

<221> misc_signal 



<222> (1221) . . . (1226) 
<223> G box 

<221> misc_signal 
<222> (1343) . . . (1356) 
<223> Inverse L box 

<221> misc_signal 
<222> (1369) . . . (1374) 
<223> A box 

<221> misc_signal 
<222> (1377) . . . (1382) 
<223> GT box 

<221> misc_signal 
<222> (1483) . . . (1488) 
<22 3> GT box 

<221> misc_signal 
<222> (1562) . . . (1567) 
<22 3> Inverted W box 

<221> misc_signal 
<222> (1600) . . . (1614) 
<223> L box 

<221> CAAT_signal 
<222> (1675) . . . (1679) 

<221> misc_signal 
<222> (1681) . . . (1690) 
<223> E box 

<221> CAAT_signal 
<222> (1695) . . . (1699) 

<221> TATA_signal 
<222> (1735) . . . (1739) 

<221> misc_signal 

<222> (1772) ... (0) 

<223> Transcription origin 

<400> 1 

aaagttaggg acaatctata gtgtcacaaa gttgcttatg gcttttggtt cagataaaga 60 
aaaagaacag cattttaatt tgtgaagatt agtctgagca gaatttcatt gtatctagaa 12 0 
agaaattgaa aaaagaaata ttctatttca ctattatgtt aggtgcaact atatcatcac 180 
catggaaaag ccggagtaaa aagagaacgt agaggagatt tcatgatttg attgagaata 24 0 
taatatatta tttttttgta attccacaca aagattaaga aaatgatctg atcaatgatg 3 00 
gctccgagga tttggctgtc gcgggaacta tgacattaat ataaatttgt cgctgcctat 360 
aaagacccta tctatctatc tatctatcta tatatatata tatatatata tatatatata 420 
tatatatata tatatatata tatatatata tatatataag cgctaatatt tgattatttt 480 
ttaaaaatat ttataagtat atatgaaatt tttgacgaaa tttttgtgtg accgtgaccc 540 
ctcaacctat agtgtgcgtc cacctgtgcc aacaatatag agacaatttg ctcgtatagt 600 
cagaaagagt gttttacttt ttagttgctt tttagtgaat ctactcggta taaagttaaa 660 
ttagtgggtc aataagtcgg gtgaatagtt aaagaaaaca gtggtgagtt tagctgtcaa 72 0 



ataatttctt ctttttcttg ttttcacatt agaaatcaaa ataaaacaca agctttttgt 780 
atttatttta acacaagcta attatatgtt tatatgctgg ttaggtgaag taaagcatgt 840 
tatatgagga aagtacgaag aaaatgtgcc aattgtcgtg tacagcaaag cagccagcac 900 
aagcaaattc gcacttgata agtggctaag tccactttct agtggaccta gtggttcact 960 
aacttttacc aaaaaggcaa taatttgcaa ttcaaaaaga aaaaaggaaa aaagaaaact 102 0 
agacagactt taacacacca actcccacag gaagcaacaa tgcaactcac aaaaggaaac 1080 
cgagtttttc cgcgacggat ctagaatttg ggttcattct ttacgctttt tcgtattaaa 1140 
ctcattatat ttgtataatt atgggtttat attttttatt tattgtaatt tttgtaaaat 1200 
tttatatata agtgtatact ccacgtctcc ggatactaca ttagcctcta gggttcttaa 1260 
tactcttgtt aaattgtcca ggctccaaac gcatgttcgt ttcaatttta acggatgttt 1320 
ccgaacaact ccaaatgttc aatgttaggt gtgtttggtg ttaagcttcc gtcctaggtt 1380 
aatagaatag ataattgttg tttcttatat agttttgaac aatcgtcgcc ataaactaat 1440 
ttttaggatg gaagctaatt tttaggatgg agtacagcct aaggttaaaa tataactata 1500 
aaaaatatcc ataaaaggtg aaatttaatt agtaacatga aaagataaaa ctagtgttat 1560 
cggtcaaact ttcaaaagag aaagaaataa ctagacaaac ttcaacaacc aacctgccca 1620 
acatgctact gtgcaattga aaaataaaca aaagagaacc agacaatatt tcaaccaata 1680 
ttccatcaag aaaaccaatt atgacaattc ttaaccaaag tcacaactaa cacttataaa 1740 
aagcactaac tcaactgtac atgattgtga agcctaacaa aaacactcta aaaggaaaag 1800 
actacgagaa taattacact acaactctta tagctaattc ttgtctcaag attttcagct 1860 
atg " 1863 

<210> 2 
<211> .5371 
<212> DNA 

<213> Nicotiana tabacum > 
<220> 

<221> promoter \ 
<222> (1) .-. . (1860) 

<221> misc_signal 

<222> (1772) ... (0) 

<223> Transcription origin 

<221> intron 

<222> (2282) . . . (3633) 

<221> intron 

<222> (3945) . . . (4726) 

<221> terminator 
<222> (5090) . . . (5371) 

<400> 2 

aaagttaggg acaatctata gtgtcacaaa gttgcttatg gcttttggtt cagataaaga 60 
aaaagaacag cattttaatt tgtgaagatt agtctgagca gaatttcatt gtatctagaa 120 
agaaattgaa aaaagaaata ttctatttca ctattatgtt aggtgcaact atatcatcac 180 
catggaaaag ccggagtaaa aagagaacgt agaggagatt tcatgatttg attgagaata 24 0 
taatatatta tttttttgta attccacaca aagattaaga aaatgatctg atcaatgatg 300 
gctccgagga tttggctgtc gc.gggaacta tgacattaat ataaatttgt cgctgcctat 3 60 
aaagacccta tctatctatc tatctatcta tatatatata tatatatata ' tatatatata 420 
tatatatata tatatatata tatatatata tatatataag cgctaatatt tgattatttt 480 
ttaaaaatat ttataagtat atatgaaatt tttgacgaaa tttttgtgtg accgtgaccc 540 
ctcaacctat agtgtgcgtc cacctgtgcc aacaatatag agacaatttg ctcgtatagt 600 
cagaaagagt gttttacttt ttagttgctt tttagtgaat ctactcggta taaagttaaa 660 
ttagtgggtc aataagtcgg gtgaatagtt aaagaaaaca gtggtgagtt tagctgtcaa 720 
ataatttctt ctttttcttg ttttcacatt agaaatcaaa ataaaacaca agctttttgt 780 



atttatttta 
tatatgagga 
aagcaaattc 
aacttttacc 
agacagactt 
cgagtttttc 
ctcattatat 
tttatatata 
tactcttgtt 1 
ccgaacaact 
aatagaatag 
ttttaggatg 
aaaaatatcc 
cggtcaaact 
acatgctact 
ttccatcaag 
aagcactaac 
actacgagaa 
atggaatcct 
tatgccatgc 
ttggaagttt 
gtttctcaaa 
tatgtcttgg 
gggggccaaa 
aatggtgcat 
tggtaagttt 
ttcacaagac 
gtgttattat 
aacatttgtc 
caatagtgaa 
ataggattta 
atcttaatca 
tttctctttg 
tgcgcggggt 
gtgaatctaa 
aagcaaactt 
atattaaggc 
ttacggcgct 
ctcttaactt 
tacaactaat 
agtagagttt 
aggctgtttt 
aattcaaaac 
aatgaaagat 
atatattata 
aagtgtcaca 
gttcgaacct 
catatctgac 
aaggaggagt 
acccaaaatt 
aaaaaatact 
gtggtcttgg 
attttgattt 
cttattttgt 
tcgttctcaa 
agttgtatgc 
cacttgtgaa 



acacaagcta 
aagtacgaag 
gcacttgata 
aaaaaggcaa 
taacacacca 
cgcgacggat 
ttgtataatt 
agtgtatact 
aaattgtcca 
ccaaatgttc 
ataattgttg 
gaagctaatt 
ataaaaggtg 
ttcaaaagag 
gtgcaattga 
aaaaccaatt 
tcaactgtac 
taattacact 
caaccaaaag 
aactattgtc 
ttgagatatt 
ttcctaactg 
ctagttactc 
aaagagtgta 
caatggggcc 
tgtcctactg 
aagattctta 
tctccatctg 
cttctagaaa 
tatggttatt 
ttattaatgt 
gagtttgatg 
aatagacctt 
tcggggaaag 
attaatgaga 
tagtggatgg 
gtcgcagccg 
agaataatga 
aaatctatat 
gttttgagaa 
ggggaggata 
cgaaagactc 
ctatattaag 
ttatgtttga 
agtatctttt 
ttgttggaga 
gaaattttag 
tctctttcta 
tccatttgac 
caatgatgtt 
tgaaaattac 
agttaacctc 
gccacatgtt 
tgtttattat 
aaaagtttat 
tctgagtctt 
ttctattata 



attatatgtt 
aaaatgtgcc 
agtggctaag 
taatttgcaa 
actcccacag 
ctagaatttg 
atgggtttat 
ccacgtctcc 
ggctccaaac 
aatgttaggt 
tttcttatat 
tttaggatgg 
aaatttaatt 
aaagaaataa 
aaaataaaca 
atgacaattc 
atgattgtga 
acaactctta 
ccaaatacca 
atcttcagtc 
agccaaatct 
cacaaaacct 
cttgtttact 
tggtttgtca 
acttttggct 
tgtattcttt 
agttttatta 
atccttttat 
atcaggtatt 
aattagtgtt 
tgtcaaagat 
ttcgagccac 
acacaataaa 
gaccgcatca 
ctaaaaaatg 
cttggaaaga 
ttagatactt 
aatttctcta 
tataaaacta 
aacaaaataa 
gtgtgtacgg 
tcggctcaag 
tttataatcc' 
ctcctctaaa 
tttggtttcc 
tgggggcaac 
ttaaagataa 
atttcaaatt 
agggtacacg 
ttcaacaagg 
aaaggttttg 
aagatgatta 
gttcaacatg 
atttacttcg 
aaaggttttg 
gcgcatggta 
taaggagtaa 



tatatgctgg 
aattgtcgtg 
tccactttct 
ttcaaaaaga 
gaagcaacaa 
ggttcattct 
attttttatt 
ggatactaca 
gcatgttcgt 
gtgtttggtg 
agttttgaac 
agtacagcct 
agtaacatga 
ctagacaaac 
aaagagaacc 
ttaaccaaag 
agcctaacaa 
tagctaattc 
acacaatcag 
ctcccctttg 
aatgacacta 
gaagcaccta 
tgttccattg 
caagtgggaa 
ttgcttcaaa 
ttgcagtggc 
cttgtcgatt 
tggtcacttt 
attaattttt 
ttaaggaaga 
tctggtggat 
aggaatgaat 
aggacaaccc 
ccaggtctat 
gaacccaaca 
tctttcttct 
tcaagaagaa 
tatatataat 
tattaagtaa 
caacaacatc 
agaccttacc 
aacattaaaa 
atggtatatt 
cttgttttta 
cactgtggcc 
gctcacaaca 
agaagtactt 
acaggtttga 
gtgtgcatgc 
caatgatcaa 
agaaccttaa 
catctaaata 
ccccttccta 
atcatcaggt 
aactccatca 
tcatagtcaa 
gcctaccaaa 



ttaggtgaag 
tacagcaaag 
agtggaccta 
aaaaaggaaa 
tgcaactcac 
ttacgctttt 
tattgtaatt 
ttagcctcta 
ttcaatttta 
ttaagcttcc 
aatcgtcgcc 
aaggttaaaa 
aaagataaaa 
ttcaacaacc 
agacaatatt 
tcacaactaa 
aaacactcta 
ttgtctcaag 
aagaagagcg 
tgttgcattc 
aactttctgc 
ctatgttaaa 
ttgaagatga 
aattctttgt 
ataaagtatt 
tgtattgatt 
tatgttagtc 
acctaaaaat 
caattccatc 
tgtaaggata 
ggatcggaga 
ttgtttttaa 
ggtacactaa 
tgtacgcagc 
ccagtgaaaa 
tgaataactt 
agctaaaaaa 
tcaaaagtta 
cttctgccta 
aaacccaatg 
cctaccttat 
atttgagaaa 
atattggctt 
atgcaaaaga 
gctaaattcg 
aagacgattc 
accataatgg 
actaaaagat 
atttgaatat 
tcacacaact 
aactttggtt 
ccccacaatt 
tcctggtacc 
ctaggtctgt 
cctattgctt 
tttatttaag 
aaggagcgaa 



taaagcatgt 
cagccagcac 
gtggttcact 
aaagaaaact 
aaaaggaaac 
tcgtattaaa 
tttgtaaaat 
gggttcttaa 
acggatgttt 
gtcctaggtt 
ataaactaat 
tataactata 
ctagtgttat 
aacctgccca 
tcaaccaata 
cacttataaa 
aaaggaaaag 
attttcagct 
taactgcaca 
aacaattcaa 
ttctcaaatt 
taggatgctt 
aaaaaataat 
taaaaatgaa 
cataaacagc 
ggttgccttt 
gtatgtgcta 
attgttacaa 
tttattactc 
atttaatcaa 
aaatttcttc 
tagggagtat 
gcttccgtta 
gttacccaac 
ccaaaaaaag 
ggagcgctat 
tgttttaaag 
ataatttatt 
atttataata 
aaatcccaca 
aaagttaaag 
acaaaatata 
agtaatctga 
ggcacaacat 
gattcgctgg 
tataattagt 
tagatatgat 
gcagttcttg 
ccaaaatcgg 
gtagtcatga 
gatgttggag 
aagggcacta 
ttctctcgtt 
caagttaaat 
taggattttg 
ctcgttattg 
aatattttcc 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 



aaaactcttt ttaaaccttc ctcaccccat tcccctctcc cctctccccc aacaccaccc 4260 
accaccccaa ctcccccgtc ttagtttttt tatttatcct ggactttctt atattttatg 4320 
ctttccttta attgaactct tgtaactaaa ccatttgccc cccaccctat agtgtttgcc . 4380 
taaattttat atttttcaaa ataatatttt ctatttacta attaaacatt agaaaatatt 4440 
tttcggattt ttttccactc accaaccaag catgggaaaa tagtgataaa actactcatt 4500 
tttcaaaata atattttcaa ggaaaacatt ttcctttata ccaaataccc ttactcttgt 4560 
atacaaatct tcatgtcgat gatcttgcaa tatatataca tgtatatgta tgatttgata 4620 
aaccacatga acaaaatggt tgagctctgc gaattgtgat atatgatttg cttatgtgtt 4680 
gtgcactatc aattacttaa attaaacttc atctaataat attgcagggg tggaacatgt 4740 
tgggggagat atgtttgaaa gtgttccaga aggagatgct atttttatga agtggattct 4800 
tcatgactgg agtgatagtc acaacctcaa gttgctaaag aactgctaca aggctctacc 4860 
agacaatgga aaggtgattg ttgttgaggc cattttacca gtgaaaccag acattgacac 4 92 0 
cgcagtggtt ggcgtttcgc aatgtgattt gatcatgatg gctcaaaatc ctggaggcaa 4 980 
agagcgatcg gaagaggagt ttcgagcctt ggctactgaa gctggattca aaggcgttaa 5040 
cttaatatgt tgtgtctgta atttttgggt catggaattc tgcaagtaga tttctactgt 5100 
acattgagtt tctactactc ttgagtatcc atttatggca atctgggact ggaattgcag 5160 
cttagtccag attgaacatt gatattccta ataatatttc tattatttcc cttgtttatt 5220 
tctcttgtat gaaaggatgt cattttgagt attgataatc atgttctcta ggacagaaat 52 80 
tgtaactttg tccaacttta ttgatattcc tagtaagatt tatatgacat gtgtctctgg 5340 
tttgagaaga gtttcaatat ctacagacgg g 5371 
<210> 3 
<211> 1095 
<212> DNA 

<213> Nicotiana tabacum 

<220> 
<221> CDS 

<222> (1) . . . (1095) 
<400> 3 

atg gaa tec tea acc aaa age caa ata cca aca caa tea gaa gaa gag 48 
Met Glu Ser Ser Thr Lys Ser Gin lie Pro Thr Gin Ser Glu Glu Glu 
15 10 15 

cgt aac tgc aca tat gee atg caa eta ttg tea tct tea gtc etc ccc 96 
Arg Asn Cys Thr Tyr Ala Met Gin Leu Leu Ser Ser Ser Val Leu Pro 
20 25 30 

ttt gtg ttg cat tea aca att caa ttg gaa gtt ttt gag ata tta gee 144 
Phe Val Leu His Ser Thr lie Gin Leu Glu Val Phe Glu lie Leu Ala 
35 40 45 

aaa tct aat gac act aaa ctt tct get tct caa att gtt tct caa att 192 
Lys Ser Asn Asp Thr Lys Leu Ser Ala Ser Gin lie Val Ser Gin lie 
50 55 60 

cct aac tgc aca aaa cct gaa gca cct act atg tta aat agg atg ctt 240 
Pro Asn Cys Thr Lys Pro Glu Ala Pro Thr Met Leu Asn Arg Met Leu 
65 70 75 80 

tat gtc ttg get agt tac tec ttg ttt act tgt tec att gtt gaa gat 288 
Tyr Val Leu Ala Ser Tyr Ser Leu Phe Thr Cys Ser lie Val Glu Asp 
85 90 95 



gaa aaa aat aat ggg ggc caa aaa aga gtg tat ggt ttg tea caa gtg 33 6 
Glu Lys Asn Asn Gly Gly Gin Lys Arg Val Tyr Gly Leu Ser Gin Val 
100 105 110 



gga aaa ttc ttt gtt aaa aat gaa aat ggt gca tea atg ggg cca ctt 384 
Gly Lys Phe Phe Val Lys Asn Glu Asn Gly Ala Ser Met Gly Pro Leu 
115 120 125 

ttg get ttg ctt caa aat aaa gta ttc ata aac age tgg ttt gaa eta 432 
Leu Ala Leu Leu Gin Asn Lys Val Phe lie Asn Ser Trp Phe Glu Leu 
130 135 140 

aaa gat gca gtt ctt gaa gga gga gtt cca ttt gac agg gta cac ggt 480 
Lys Asp Ala Val Leu Glu Gly Gly Val Pro Phe Asp Arg Val His Gly 
145 , 150 155 160 

gtg cat gca ttt gaa tat cca aaa teg gac cca aaa ttc aat gat gtt 528 
Val His Ala Phe Glu Tyr Pro Lys Ser Asp Pro Lys Phe Asn Asp Val 
165 170 175 

ttc aac aag gca atg ate aat cac aca act gta gtc atg aaa aaa ata 576 
Phe Asn Lys Ala Met lie Asn His Thr Thr Val Val Met Lys Lys lie 
180 185 190 

ctt gaa aat tac aaa ggt ttt gag aac ctt aaa act ttg gtt gat gtt 624 
Leu Glu Asn Tyr Lys Gly Phe Glu Asn Leu Lys Thr Leu Val Asp Val 
195 200 205 

99 a 99t ggt ctt gga gtt aac etc aag atg att aca tct aaa tac ccc 672 
Gly Gly Gly Leu Gly Val Asn Leu Lys Met lie Thr Ser Lys Tyr Pro 
210 215 220 

aca att, aag ggc act aat ttt gat ttg cca cat gtt gtt caa cat gec 720 
Thr lie Lys Gly Thr Asn Phe Asp Leu Pro His Val Val Gin His Ala 
225 230 235 240 

cct tec tat cct ggg gtg gaa cat gtt ggg gga gat atg ttt gaa agt 768 
Pro Ser Tyr Pro Gly Val Glu His Val Gly Gly Asp Met Phe Glu Ser 
245 250 255 

gtt cca gaa gga gat get att ttt atg aag tgg att ctt cat gac tgg 816 
Val Pro Glu Gly Asp Ala lie Phe Met Lys Trp lie Leu His Asp Trp 
260 265 270 

agt gat agt cac aac etc aag ttg eta aag aac tgc tac aag get eta 864 
Ser Asp Ser His Asn Leu Lys Leu Leu Lys Asn Cys Tyr Lys Ala Leu 
275 280 285 

cca gac aat gga aag gtg att gtt gtt gag gee att tta cca gtg aaa 912 
Pro Asp Asn Gly Lys Val lie Val Val Glu Ala lie Leu Pro Val Lys 
290 295 300 

cca gac att gac ace gca gtg gtt ggc gtt teg caa tgt gat ttg ate 960 
Pro Asp lie Asp Thr Ala Val Val Gly Val Ser Gin Cys Asp Leu lie 
305 310 315 320 

atg atg get caa aat cct gga ggc aaa gag cga teg gaa gag gag ttt 1008 
Met Met Ala Gin Asn Pro Gly Gly Lys Glu Arg Ser Glu Glu Glu Phe 
325 330 335 



cga gcc ttg get act gaa get gga ttc aaa ggc gtt aac tta ata tgt 
Arg Ala Leu Ala Thr Glu Ala Gly Phe Lys Gly Val Asn Leu lie Cys 
340 345 350 



1056 



tgt gtc tgt aat ttt tgg gtc atg gaa ttc tgc aag tag 
Cys Val Cys Asn Phe Trp Val Met Glu Phe Cys Lys * 
355 360 



1095 



<210> 4 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide No. 1 
<400> 4 

cgtttcgcaa tgtgatttga tc 22 

<210> 5 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide No. 2 



<210> 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide No. 3 
<400> 6 

ctgaagatgt caatagttgc atggc 2 5 

<210> 7 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide No. PAS1 



<400> 5 

ctcaaaatga catcctttca tac 



23 



<400> 7 

ggtctagagg gecttttaga gtgtttttgt tag 



33 



<210> 8 

<211> 29 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic Oligonucleotide No. PS1 



<400> 8 

aaagtcgacc gtccacctgt gccaacaat 2 9 

<210> 9 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide No. PS2 
<400> 9 

tgtttggtgt tatgcttccg tcct 24 

<210> 10 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide No. PS3 
<400> 10 

aaaaagcttt tttaggatgg agtacagcc 29 

<210> 11 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide No. PS4 
<400> 11 

tttaagctta aagagaacca gacaatatt 2 9 

<210> 12 
<211> 354 
<212> DNA 

<213> Phytophthora megasperma 

<220> 

<221> CDS 

<222> (1) . . . (354) 

<221> CDS 

<222> (1) . . . (60) 

<223> Preprotein 

<400> 12 

atg aac ttc acc get ctg etc get gec gtc gec gec gec ttg gtc gga 48 

Met Asn Phe Thr Ala Leu Leu Ala Ala Val Ala Ala Ala Leu Val Gly 
15 10 15 



tct gcc aac gcc acc gcg tgc acc gcc acc cag cag acc get gcg tac 96 
Ser Ala Asn Ala Thr Ala Cys Thr Ala Thr Gin Gin Thr Ala Ala Tyr 
20 25 30 



aag aca etc gtg age ate ctg teg gac gcg teg ttc aac aag tgc tct 144 
Lys Thr Leu Val Ser lie Leu Ser Asp Ala Ser Phe Asn Lys Cys Ser 
35 40 45 

acg gat teg ggc tac tec atg ctg acg gcc aag gcc etc ccc acc acg 192 
Thr Asp Ser Gly Tyr Ser Met Leu Thr Ala Lys Ala Leu Pro Thr Thr 
50 55 60 

gcg cag tac aag etc atg tgc gcg tec acg gca tgc aac acc atg ate 240 
Ala Gin Tyr Lys Leu Met Cys Ala Ser Thr Ala Cys Asn Thr Met lie 
65 70 75 80 

aag aag ate gtg acg ctg aac ccg ccc aac tgc gac ctg acg gtg ccc 288 
Lys Lys lie Val Thr Leu Asn Pro Pro Asn Cys Asp Leu Thr Val Pro 
85 90 95 

acg age ggc ctg gtg etc aac gtg tac teg tac gcg aac ggc ttc teg 336 
Thr Ser Gly Leu Val Leu Asn Val Tyr Ser Tyr Ala Asn Gly Phe Ser 
100 105 110 

gac aag tgc teg teg ctg 354 
Asp Lys Cys Ser Ser Leu 
115 



<210> 13 
<211> 294 
<212> DNA 

<213> Phytophthora megasperma 

<220> 

<221> CDS 

<222> (1) . . . (294) 

<400> 13 

acc gcg tgc acc gcc acc cag cag acc get gcg tac aag aca etc gtg 4 8 

Thr Ala Cys Thr Ala Thr Gin Gin Thr Ala Ala Tyr Lys Thr Leu Val 
1 5 10.. 15 

age ate ctg teg gac gcg teg ttc aac aag tgc tct acg gat teg ggc 96 
Ser lie Leu Ser Asp Ala Ser Phe Asn Lys Cys Ser Thr Asp Ser Gly 
20 25 30 

tac tec atg ctg acg gcc aag gcc etc ccc acc acg gcg cag tac aag 144 
Tyr Ser Met Leu Thr Ala Lys Ala Leu Pro Thr Thr Ala Gin Tyr Lys 
35 40 45 

etc atg tgc gcg tec acg gca tgc aac acc atg ate aag aag ate gtg 192 
Leu Met Cys Ala Ser Thr Ala Cys Asn Thr Met lie Lys Lys lie Val 
50 55 60 



acg ctg aac ccg ccc aac tgc gac ctg acg gtg ccc acg age ggc ctg 240 



Thr Leu Asn Pro Pro Asn Cys Asp Leu Thr Val Pro Thr Ser Gly Leu 
65 70 75 80 



gtg etc aac gtg tac teg tac gcg aac ggc ttc teg gac aag tgc teg 
Val Leu Asn Val Tyr Ser Tyr Ala Asn Gly Phe Ser Asp Lys Cys Ser 
85 90 95 



288 



teg ctg 
Ser Leu 



294 



<210> 14 

<211> 1620 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> promoter 

<222> (1) . . . (1263) 

<223> COMTII promoter 

<221> CDS 

<222> (1264) . . . (1620) 

<223> CDS megaspermine 

<223> Synthetic construct 



<400> 14 

cgtccacctg tgccaacaat atagagacaa tttgctcgta 
ctttttagtt gctttttagt gaatctactc ggtataaagt 
tcgggtgaat agttaaagaa aacagtggtg agtttagctg 
cttgttttca cattagaaat caaaataaaa cacaagcttt 
gctaattata tgtttatatg ctggttaggt gaagtaaagc 
gaagaaaatg tgccaattgt cgtgtacagc aaagcageca 
gataagtggc taagtccact ttctagtgga cctagtggtt 
gcaataattt gcaattcaaa aagaaaaaag gaaaaaagaa 
accaactccc acaggaagca acaatgeaac tcacaaaagg 
ggatctagaa tttgggttca ttctttaege tttttegtat 
aattatgggt ttatattttt tatttattgt aatttttgta 
tactccacgt ctccggatac tacattagee tctagggttc 
tccaggctcc aaacgcatgt tcgtttcaat tttaaeggat 
gttcaatgtt aggtgtgttt ggtgttaagc ttccgtccta 
gttgtttctt atatagtttt gaacaatcgt cgccataaac 
aatttttagg atggagtaca gectaaggtt aaaatataac 
ggtgaaattt aattagtaac atgaaaagat aaaactagtg 
agagaaagaa ataactagac aaacttcaac aaccaacctg 
ttgaaaaata aacaaaagag aaccagacaa tatttcaacc 
aattatgaca attcttaacc aaagtcacaa ctaacactta 
gtacatgatt gtgaagecta acaaaaacac tctaaaaggc 
acc atg aac ttc ace get ctg etc get gee gtc 
Met Asn Phe Thr Ala Leu Leu Ala Ala Val 
15 10 



tagtcagaaa gagtgtttta 60 
taaattagtg ggtcaataag 120 
tcaaataatt tcttcttttt 180 
ttgtatttat tttaacacaa 240 
atgttatatg aggaaagtac 300 
gcacaagcaa attegcaett 360 
cactaacttt taccaaaaag 420 
aactagacag actttaacac 480 
aaaccgagtt tttccgcgac 540 
taaactcatt atatttgtat 600 
aaattttata tataagtgta 660 
ttaatactct tgttaaattg 720 
gtttccgaac aactccaaat . 780 
ggttaataga atagataatt 840 
taatttttag gatggaagct 900 
tataaaaaat atccataaaa 960 
ttateggtea aactttcaaa 1020 
cccaacatgc tactgtgcaa 1080 
aatattccat caagaaaacc 1140 
taaaaagcac taactcaact 1200 
ctctagagga tccccggggt 1260 
gee gee gee ttg gtc 1308 
Ala Ala Ala Leu Val 
15 



gga tct gee aac gee acc gcg tgc acc gee acc 
Gly Ser Ala Asn Ala Thr Ala Cys Thr Ala Thr 
20 25 



cag caa acc get gcg 
Gin Gin Thr Ala Ala 



30 



1356 



tac aaa aca etc gtg age ate ctg teg gac gcg teg ttc aac aag tgc 1404 
Tyr Lys Thr Leu Val Ser lie Leu Ser Asp Ala Ser Phe Asn Lys Cys 
35 40 45 

tct acg gat teg ggc tac tec atg ctg acg gee aag gee etc ccc acc 1452 
Ser Thr Asp Ser Gly Tyr Ser Met Leu Thr Ala Lys Ala Leu Pro Thr 
50 55 60 

acg gcg cag tac aag etc atg tgc gcg tec acg gca tgc aac acc atg 1500 
Thr Ala Gin Tyr Lys Leu Met Cys Ala Ser Thr Ala Cys Asn Thr Met 
65 70 75 

ate aaa aaa ate gtg acg ctg aac ccg ccc aac tgc aac ctg acg gtg 1548 
lie Lys Lys lie Val Thr Leu Asn Pro Pro Asn Cys Asn Leu Thr Val 
80 85 90 95 

ccc acg age ggc ctg gtg etc aac gtg tac teg tac cca aac ggc ttc 1596 
Pro Thr Ser Gly Leu Val Leu Asn Val Tyr Ser Tyr Pro Asn Gly Phe 
100 105 110 

teg gac aag tgc teg teg ctg taa 1620 
Ser Asp Lys Cys Ser Ser Leu * 
115 



<210> 15 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PAS 2 
<400> 15 

cgcggatccc cttttagagt gtttttgtta ggc 33 

<210> 16 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PS5 
<400> 16 

acgcgtcgac gttagggaca atctatagtg tcac 34 

<210> 17 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PS6 



<400> 17 



acgcgtcgac gctccgagga tttggctgtc gcgg 

<210> 18 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PS7 
<400> 18 

acgcgtcgac gctggttagg tgaagtaaag catg 

<210> 19 
<211> 33 
<212> DNA 

<213> Artificial Sequence 

<220> f 

<223> Synthetic Oligonucleotide PS8 

<400> 19 

acgcgtcgac gcatgttata tgaggaaagt acg 

<210> 20 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PS9 
<400> 20 

acgcgtcgac gcagccagca caagcaaatt cgc 

<210> 21 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PS10 
<400> 21 

acgcgtcgac gactttaaca caccaactcc c 

<210> 22 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PS11 
<400> 22 

acgcgtcgac cggatctaga atttgggttc attc 



<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PS12 

<400> 23 

acgcgtcgac gtgtatactc cacgtctccg gatac 

<210> 24 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PS13 
<400> 24 

acgcgtcgac gttcaatgtt aggtgtgttt gg 

<210> 25 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PAS 3 
<400> 25 

cgcggatccg cttaacacca aacacaccta acattg 

<210> 26 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide PS14 
<400> 26 

acgcgtcgac cagtggtgag tttagctgtc 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

( i ) APPLICANT : 

(A) NAME: RHOBIO 

(B) STREET: 14-20 Rue Pierre BAIZET 

(C) CITY: LYONS 

(E) COUNTRY: France 

(F) POSTAL CODE: 69009 

(ii) TITLE OF INVENTION: Inducible COMT II 
promoter, chimeric gene comprising it and transformed 
plants 



(iii) NUMBER OF SEQUENCES: 26 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, #1.30 
(EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1863 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_signal 

(B) LOCATION: 667.. 672 

(D) OTHER INFORMATION: /function = "inverted 
W box" 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 820.. 830 

(D) OTHER INFORMATION: /function = "inverted 
L box" 

(ix) FEATURE: 

(A) NAME/KEY: enhancer 

(B) LOCATION: 845.. 852 

(ix) FEATURE: 
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(A) NAME /KEY: misc_signal 

(B) LOCATION: 1034.. 1047 

(D) OTHER INFORMATION: /function = "P box" 



(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 1221.. 1226 

(D) OTHER INFORMATION: /function = "G box" 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 1343.. 1356 

(D) OTHER INFORMATION: /function = "inverted 
L box" 



(ix) FEATURE: 

(A) NAME/KEY: misc_signal 
■•(B) LOCATION: 1369.. 1374 
(D) OTHER INFORMATION: /function = "A box" 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 1377.. 1382 

(D) OTHER INFORMATION: /function = "GT box" 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 1483.. 1488 

(D) OTHER INFORMATION: /function = "GT box"- 



(ix) FEATURE: 

(A) NAME / KEY : misc_signal 

(B) LOCATION: 1562.. 1567 

(D) OTHER INFORMATION: /function = "inverted 
W box" 

(ix) FEATURE: 

(A) NAME / KEY : misc_signal 

(B) LOCATION: 1600.. 1614 

(D) OTHER INFORMATION: /function = "L box" 

(ix) FEATURE: 

(A) NAME /KEY: CAAT_signal 

(B) LOCATION: 1675.. 1679 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 1681.. 1690 

(D) OTHER INFORMATION: /function = "E box" 



(ix) FEATURE: 
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(A) NAME /KEY: CAAT_signal 
(,B) LOCATION: 1695.. 1699 

(ix) FEATURE: 

(A) NAME /KEY: TATA_signal 

(B) LOCATION: 1735 . . 1739 

(ix) FEATURE: 

(A) NAME/KEY: transcription origin 

(B) LOCATION: 1772 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

AAAGTTAGGG ACAATCTATA GTGTCACAAA GTTGCTTATG GCTTTTGGTT CAGATAAAGA 60 

AAAAGAACAG CATTTTAATT TGTGAAGATT AGTCTGAGCA GAATTTCATT GTATCTAGAA 120 

AGAAATTGAA AAAAGAAATA TTCTATTTCA CTATTATGTT AGGTGCAACT ATATCATCAC 180 

CATGGAAAAG CCGGAGTAAA. AAGAGAACGT AGAGGAGATT TCATGATTTG ATTGAGAATA 240 

TAATATATTA TTTTTTTGTA ATTCCACACA AAGATTAAGA AAATGATCTG ATCAATGATG 300 

GCTCCGAGGA TTTGGCTGTC GCGGGAACTA TGACATTAAT ATAAATTTGT CGCTGCCTAT 360 

AAAGACCCTA TCTATCTATC TATCTATCTA TATATATATA TATATATATA TATATATATA 420 

TATA TATA TA TATATATATA TATATATATA TATATATAAG CGCTAATATT TGATTATTTT 480 

TTAAAAATAT TTATAAGTAT ATATGAAATT TTTGACGAAA TTTTTGTGTG ACCGTGACCC 540 

CTCAACCTAT AGTGTGCGTC CACCTGTGCC AACAATATAG AGACAATTTG CTCGTATAGT 600 
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CAGAAAGAGT GTTTTACTTT TTAGTTGCTT TTTAGTGAAT CTACTCGGTA TAAAGTTAAA 660 

TTAGTGGGTC AATAAGTCGG GTGAATAGTT AAAGAAAACA GTGGTGAGTT TAGCTGTCAA 7 20 

ATAATTTCTT CTTTTTCTTG TTTTCACATT AGAAATCAAA ATAAAACACA AGCTTTTTGT 780 

ATTTATTTTA ACACAAGCTA ATTATATGTT TATATGCTGG TTAGGTGAAG TAAAGCATGT 840 

TATATGAGGA AAGTACGAAG AAAATGTGCC AATTGTCGTG TACAGCAAAG CAGCCAGCAC 900 

AAGCAAATTC GCACTTGATA AGTGGCTAAG TCCACTTTCT AGTGGACCTA GTGGTTCACT 960 

AACTTTTACC* AAAAAGGCAA TAATTTGCAA TTCAAAAAGA AAAAAGGAAA AAAGAAAACT 1020 

AGACAGACTT TAACACACCA ACTCCCACAG GAAGCAACAA TGCAACTCAC AAAAGGAAAC 1080 

CGAGTTTTTC CGCGACGGAT CTAGAATTTG GGTTCATTCT TTACGCTTTT TCGTATTAAA 1140 

CTCATTATAT TTGTATAATT ATGGGTTTAT ATTTTTTATT TATTGTAATT TTTGTAAAAT 1200 

TTTATATATA AGTGTATACT CCACGTCTCC GGATACTACA TTAGCCTCTA GGGTTCTTAA 1260 

TACTCTTGTT AAATTGTCCA GGCTCCAMC GCATGTTCGT TTCAATTTTA ACGGATGTTT 1320 

CCGAACAACT CCAAATGTTC AATGTTAGGT GTGTT'I'GGTG TTAAGCTTCC GTCCTAGGTT 1380 

AATAGAATAG ATAATTGTTG TTTCTTATAT AGTTTTGAAC AATCGTCGCC ATAAACTAAT 14 40 

TTTTAGGATG GAAGCTAATT TTTAGGATGG AGTACAGCCT AAGGTTAAAA TATAACTATA 1500 

AAAAATATCC ATAAAAGGTG AAATTTAATT AGTAACATGA AAAGATAAAA CTAGTGTTAT 1560 

CGGTCAAACT TTCAAAAGAG AAAGAAATAA CTAGACAAAC TTCAACAACC AACCTGCCCA 1620 

ACATGCTACT GTGCAATTGA AAAATAAACA AAAGAGAACC AGACAATATT TCAACCAATA 168 0 

TTCCATCAAG AAAACCAATT ATGACAATTC TTAACCAAAG TCACAACTAA CACTTATAAA 174 0 

AAGCACTAAC TCAACTGTAC ATGATTGTGA AGCCTAACAA AAACACTCTA AAAGGAAAAG 1800 

ACTACGAGAA TAATTACACT ACAACTCTTA TAGCTAATTC TTGTCTCAAG ATTTTCAGCT IB 60 

ATG 1863 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5371 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: promoter 

(B) LOCATION: 1. .1860 

(ix) FEATURE: 

(A) NAME/KEY: transcription origin 

(B) LOCATION: 1772 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 1861.. 2281 
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(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 2282.. 3633 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 3634.. 3944 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 3945.. 4726 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 4727.. 5089 

(ix) FEATURE: 

(A) NAME/KEY: terminator 

(B) LOCATION: 5090.. 5371 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

AAAGTTAGGG ACAATCTATA GTGTCACAAA GTTGCTTATG GCTTTTGGTT CAGATAAAGA 60 

AAAAGAACAG CATTTTAATT TGTGAAGATT AGTCTGAGCA GAATTTCATT GTATCTAGAA 120 

AGAAATTGAA AAAAGAAATA TTCTATTTCA CTATTATGTT AGGTGCAACT AT AT CATC AC 180 

CATGGAAAAG CCGGAGTAAA AAGAGAACGT AGAGGAGATT TCATGATTTG ATTGAGAATA 24 0 

TAATATATTA TTTTTTTGTh ATTCCACACA AAGATTAAGA AAATGATCTG ATCAATGATG 300 

GCTCCGAGGA TTTGGCTGTC GCGGGAACTA TGACATTAAT ATAAATTTGT CGCTGCCTAT .360 

AAAGACCCTA TCTATCTATC TATCTATCTA TATATATATA TATATATATA TAT AT AT AT A 4 20 

TATATATATA TATATATATA TATATATATA TATATATAAG CGCTAATATT TGATTATTTT 4 80 

TTAAAAATAT TTATAAGTAT ATATGAAATT TTTGACGAAA TTTTTGTGTG ACCGTGACCC 54 0 

CTCAACCTAT AGTGTGCGTC CACCTGTCCC AACAATATAG AGACAATTTG CTCGTATAGT 600 

CAGAAAGAGT GTTTTACTTT TTAGTTGCTT TTTAGTGAAT CTACTCGGTA TAAAGTTAAA 660 

TTAGTGGGTC AATAAGTCGG GTGAATAGTT AAAGAAAACA GTGGTGAGTT TAGCTGTCAA 720 

ATAATTTCTT CTTTTTCTTG TTTTCACATT AGAAATCAAA ATAAAACACA AGCTTTTTGT 780 

ATTTATTTTA ACACAAGCTA ATTATATGTT TATATGCTGG TTAGGTGAAG TAAAGCATGT 840 

TATATGAGGA AAGTACGAAG AAAATGTGCC AATTGTCGTG TACAGCAAAG CAGCCAGCAC 900 

AAGCAAATTC GCACTTGATA AGTGGCTAAG TCCACTTTCT AGTGGACCTA GTGGTTCACT 960 

AACTTTTACC AAAAAGGCAA TAATTTGCAA TTCAAAAAGA AAAAAGGAAA AAAGAAAACT 1020 

AGACAGACTT TAACACACCA ACTCCCACAG GAAGCAACAA TGCAACTCAC AAAAGGAAAC 1080 

CGAGTTTTTC CGCGACGGAT CTAGAATTTG GGTTCATTCT TTACGCTTTT TCGTATTAAA 1140 

CTCATTATAT TTGTATAATT ATGGGTTTAT ATTTTTTATT TATTGTAATT TTTGTAAAAT 1200 

TTTATATATA AGTGTATACT CCACGTCTCC GGATACTACA TTAGCCTCTA GGGTTCTTAA 1260 
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TACTCTTGTT 


AAATTGTCCA 


GGCTCCAAAC 


GCATGTTCGT 


7TCAATTTTA 


ACGGATGTTT 


1320 


CCGAACAACT 


CCAAATGTTC 


AATGTTAGGT 


GTGT7TGGTG 


TTAAGCTTCC 


G7CC7AGGTT 


1380 


AA7AGAATAG 


ATAATTGTTG 


TTTCTTATAT 


AGTTTTGAAC 


AA7CGTCGCC 


ATAAACTAAT 


1440 


TTTTAGGATG 


GAAGCTAATT 


TTTAGGATGG 


AGTACAGCCT 


AAGGTTAAAA 


TATAACTATA 


1500 


AAAAATATCC 


ATAAAAGGTG 


AAATTTAATT 


AGTAACA7GA 


AAAGATAAAA 


CTAGTGTTAT 


1560 


CGGTCAAACT 


TTCAAAAGAG 


AAAGAAATAA 


CTAGACAAAC 


TTCAACAACC 


*AACC7GCCCA 


1620 


ACATGCTACT 


GTGCAATTGA 


AAAATAAACA 


AAAGAGAACC 


AGACAATATT 


TCAACCAATA 


1680 


TTCCATCAAG 


AAAACCAATT 


ATGACAATTC 


T7AACCAAAG 


TCACAACTAA 


CACTTATAAA 


1740 


AAGCACTAAC 


TCAACTGTAC 


ATGATTG7GA 


AGCCTAACAA 


AAACACTCTA 


AAAGGAAAAG 


1800 


ACTACGAGAA 


TAATTACACT 


ACAACTCTTA 


TAGCTAATTC 


T7G7CTCAAG 


A7TT7CAGCT 


1860 


ATGGAATCCT 


CAACCAAAAG 


CCAAATACCA 


ACACAATCAG 


AAGAAGAGCG 


TAACTGCACA 


1920 


TATGCCATGC 


AACTATTGTC 


ATCTTCAGTC 


CTCCCCT7TG 


TGTTGCATTC 


AACAATTCAA 


1980 


TTGGAAGTTT 


TTGAGATATT 


AGCCAAATCT 


AA7GACAC7A 


AACTTTCTGC 


TTCTCAAATT 


2040 


GTTTCTCAAA 


TTCCTAACTG 


CACAAAACCT 


GAAGCACCTA 


CTATGTTAAA 


TAGGATGCTT 


2100 


TATGTCTTGG 


CTAGTTACTC 


CTTGTTTACT 


TGTTCCATTG 


TTGAAGATGA 


AAAAAATAAT 


2160 


GGGGGCCAAA 


AAAGAGTGTA 


TGGTTTGTCA 


CAAG7GGGAA 


AATTCTTTGT 


TAAAAATGAA 


2220 


AATGGTGCAT 


CAATGGGGCC 


ACTTTTGGCT 


TTGCT7CAAA 


ATAAAGTATT 


CATAAACAGC 


2280 


TGGTAAGTTT 


TGTCCTACTG 


TGTATTCTTT 


TTGCACTGGC 


TGTATTGATT 


GGTTGCCTTT 


2340 


TTCACAAGAC 


AAGATTCTTA 


AGTTTTATTA 


CTTGTCGATT 


TATGTTAGTC 


GTATGTGCTA 


2400 


GTGTTATTAT 


TCTCCATC7G 


ATCCTTTTAT 


7GGTCAC7T7 


ACCTAAAAA7 


A7TG7TACAA 


2460 


AACATTTGTC 


CTTCTAGAAA 


ATCAGGTATT 


ATTAAT7TTT 


CAATTCCATC 


TTTATTACTC 


2520 


CAATAGTGAA 


TATGGTTATT 


AATTAGTGTT 


TTAAGGAAGA 


TGTAAGGATA 


ATTTAA7CAA 


2580 


ATAGGATTTA 


TTATTAATGT 


TGTCAAAGAT 


TC7GGTGGAT 


GGATCGGAGA 


AAAT7TCTTC 


2640 


ATCTTAATCA 


GAGTTTGATG 


TTCGAGCCAC 


AGGAA7GAA7 


TTGT7TTTAA 


TAGGGAGTAT 


2700 


TTTCTCTTTG 


AATAGACCT7 


ACACAATAAA 


AGGACAACCC 


GGTACACTAA 


GCTTCCGTTA 


2760 


TGCGCGGGGT 


TCGGGGAAAG 


GACCGCATCA 


CCAGG7CTAT 


TG7ACGCAGC 


GTTACCCAAC 


2820 


GTGAATCTAA 


ATTAATGAGA 


CTAAAAAATG 


GAACCCAACA 


CCAGTGAAAA 


CCAAAAAAAG 


2880 


AAGCAAACTT 


TAGTGGATGG 


CTTGGAAAGA 


TCTTTCTTCT 


TGAATAACTT 


GGAGCGCTAT 


2940 


ATATTAAGGC 


GTCGCAGCCG 


TTAGATACTT 


7CAAGAAGAA 


AGCTAAAAAA 


TGTTTTAAAG 


3000 


-TTACGGCGCT 


AGAATAATGA 


AATTTCTCTA 


TATA7A7AAT 


TCAAAAGTTA 


ATAATTTATT 


3060 


CTCTTAACTT 


AAATCTATAT 


TATAAAAC7A 


TAT7AAGTAA 


CTTC7GCCTA 


ATTTATAATA 


3120 


TACAACTAAT 


GTTTTGAGAA 


AACAAAATAA 


CAACAACATC 


AAACCCAATG 


AAATCCCACA 


3180 


AGTAGAGTTT 


GGGGAGGATA 


GTGTGTACGG 


AGACCTTACC 


CCTACCTTA7 


AAAGT7AAAG 


3240 
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AGGCTGTTTT 


CGAAAGACTC 


TCGGCTCAAG 


AACATTAAAA ATTTGAGAAA ACAAAATATA 


3300 


AATTCAAAAC 


CTATATTAAG 


TTTATAATCC 


ATGGTATATT 


ATATTGGCTT 


AGTAATCTGA 


3360 


AATGAAAGAT 


TTATGTTTGA 


CTCCTCTAAA 


CTTGTTTTTA 


ATGCAAAAGA 


GGCACAACAT 


3420 


ATATATTATA 


AGTATCTTTT 


TTTGG7TTCC 


CACTGTGGCC 


GCTAAATTCG 


GATTCGCTGG 


3480 


AAGTGTCACA 


TTGTTGGAGA 


TGGGGGCAAC 


GCTCACAACA 


AAGACGATTC 


TATAATTAGT 


3540 


G7TCGAACCT 


GAAATTTTAG 


TTAAAGATAA 


AGAAGTACTT 


ACCATAATGG 


TAGATATGAT 


3600 


CATATCTGAC 


TCTCTTTCTA 


ATTTCAAATT 


ACA'GGTTTGA 


ACTAAAAGAT 


GCAGTTCTTG 


3660 


AAGGAGGAGT 


TCCA7TTGAC 


AGGGTACAC'G 


GTGTGCATGC 


ATTTGAATAT 


CCAAAATCGG 


3720 


ACCCAAAATT 


CAATGATGTT 


TTCAACAAGG 


CAATGATCAA 


TCACACAACT 


GTAGTCATGA 


3780 


AAAAAATACT 


TGAAAATTAC 


AAAGGTTTTG 


AGAACCTTAA 


AACTTTGGTT 


GATGTTGGAG 


3840 


GTGGTCTTGG AGTTAACCTC AAGATGATTA CATCTAAATA CCCCACAATT AAGGGCACTA 


3900 


ATTTTGATTT 


GCCACATGTT 


GTTCAACATG 


CCCCTTCCTA 


TCCTGGTACC 


TTCTCTCGTT 


3960 


CTTATTTTGT 


TGTTTATTAT 


ATTTACTTCG 


ATCATCAGGT 


CTAGGTCTGT 


CAAGTTAAAT 


4020 


TCGTTCTCAA 


AAAAGTTTAT 


AAAGGTTTTG 


AACTCCATCA 


CCTATTGCTT 


TAGGATTTTG 


4080 


AGTTGTATGC 


TCTGAGTCTT 


GCGCATGGTA 


TCATAGTCAA 


TTTATTTAAG 


CTCGTTATTG 


4140 


CACTTGTGAA 


TTCTATTATA 


TAAGGAGTAA 


GCCTACCAAA 


AAGGAGCGAA 


AATATTTTCC 


4200 


AAAACTCTTT 


TTAAACCTTC 


CTCACCCCAT 


7CCCCTCTCC 


CCTCTCCCCC 


AACACCACCC 


4260 


ACCACCCCAA 


CTCCCCCGTC 


TTAGTTTTTT 


TATTTATCCT 


GGACTTTCTT 


ATATTTTATG 


4320 


CTTTCCTTTA 


ATTGAACTCT 


TGTAACTAAA 


CCATTTGCCC 


CCCACCCTAT 


AGTGTTTGCC 


4380 


TAAAT7TTAT 


ATTTTTCAAA 


ATAATATTTT 


CTATTTACTA 


ATTAAACATT 


AGAAAATATT 


4 440 


TTTCGGATTT 


TTTTCCACTC 


ACCAACCAAG 


CATCGGAAAA 


TAGTGATAAA 


ACTACTCATT 


4500 


TTTCAAAATA 


ATATTTTCAA 


GGAAAACATT 


TTCCTTTATA 


CCAAATACCC 


TTACTCTTGT 


4560 


ATACAAATCT 


TCATGTCGAT 


GATCTTGCAA 


TAT AT AT AC A 


TGTATATGTA 


TGATTTGATA 


4620 


AACCACATGA 


ACAAAATGGT 


TGAGCTCTGC 


GAATTGTGAT 


ATATGATTTG 


CTTATGTGTT 


4680 


GTGCACTATC 


AATTACTTAA 


ATTAAACTTC 


ATCTAATAAT 


ATTGCAGGGG 


TGGAACATGT 


4740 


TGGGGGAGAT 


ATGTTTGAAA 


GTGTTCCAGA 


AGGAGATGCT 


ATTTTTATGA 


AGTGGATTCT 


4800 


TCATGACTGG 


AGTGATAGTC 


ACAACCTCAA 


GTTGCTAAAG 


AACTGCTACA 


AGGCTCTACC 


4860 


AGACAATGGA 


AAGGTGATTG 


TTGTTGAGGC 


CATTTTACCA 


GTGAAACCAG 


ACATTGACAC 


4920 


CGCAGTGGTT 


GGCGTTTCGC 


AATGTGATTT 


GATCATGATG 


GCTCAAAATC 


CTGGAGGCAA 


4980 


ACA'GCGATCG 


GAAGAGGAGT 


TTCGAGCCTT 


GGCTACTGAA 


GCTGGATTCA 


AAGGCGTTAA 


5040 


CTTAATATGT 


TGTGTCTGTA 


ATTTTTGGGT 


CATGGAATTC 


TGCAAGTAGA 


TTTCTACTGT 


5100 


ACATTGAGTT 


TCTACTACTC 


TTGAGTATCC 


ATTTATGGCA 


ATCTGGGACT 


GGAATTGCAG 


5160 


CTTAGTCCAG 


ATTGAACATT 


GATATTCCTA 


ATAATATTTC 


TATTATTTCC 


CTTGTTTATT 


5220 



TCTCTTGTAT GAAAGGATGT CATTTTGAGT ATTGATAATC ATGTTCTCTA GGACAGAAAT 
TGTAACTTTG TCCAACTTTA TTGATATTCC TAGTAAGA7T TATATGACAT GTGTCTCTGG 
TTTGAGAAGA GTTTCAATAT CTACAGACGG G 



5280 
5340 
5371 
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( 2 ) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1095 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATG 
Met 
1 


GAA 

Glu 


TCC 
Ser 


TCA 
Ser 


ACC 
Thr 
c 


AAA 
Lys 


AGC 
Ser 


CAA 
Gin 


ATA 
He 


CCA 
Pro 
10 


ACA 
Thr 


CAA 
Gin 


TCA 
Ser 


GAA 
Glu 


GAA 
Glu 
15 


■ GAG 
Glu 


48 


CGT 
Arg 


AAC 
Asn 


TGC 
cys 


ACA 

J Mi 

20 


TAT 
Tyr 


GCC 

Aid 


ATG 
Met 


CAA 
bin 


CTA 
Leu 
25 


TTG 
Leu 


TCA 


TCT 

C o -r- 

oer 


TCA 
i>e r 


GTC 
vai 
30 


CTC 
Leu 


CCC 
fro 


96 


TTT 
Phe 


GTG 
Val 


TTG 
Leu 
35 


CAT 
His 


TCA 
Ser 


ACA 
Thr 


ATT 
He 


CAA 
Gin 
40 


TTG 
Leu 


GAA 
Glu 


GTT 
Val 


TTT 
Phe 


GAG 
Glu 
45 


ATA 
He 


TTA 
Leu 


GCC 
Ala 


144 


AAA 
Lys 


TCT 
Ser 
50 


AAT 
Asn 


GAC 
Asp 


ACT 
Thr 


AAA 
Lys 


CTT 
Leu 
55 


TCT 
Ser 


GCT 
Ala 


TCT 
Ser 


CAA 
Gin 

j 


ATT 
He 
60 


GTT 
Val 


TCT 
Ser 


CAA 
Gin 


ATT 
He 


192 


CCT 
Pro 
6S 


AAC 
Asn 


TGC 
Cys 


ACA 
Thr 


AAA 
Lys 


CCT 
Pro 
70 


GAA 
Glu 


GCA 
Ala 


CCT 
Pro 


ACT 
Thr 


ATG 
Met 

75 


TTA 
Leu 


AAT 
Asn 


AGG 
Arg 


ATG 
Met 


CTT 
Leu 
80 


240 


TAT GTC 
Tyr Val 


TTG 
Leu 


GCT 
Ala 


AGT 
Ser 
85 


TAC 
Tyr 


TCC 
Ser 


TTG 
Leu 


TTT 
Phe 


ACT 
Thr 
90 


TGT 

Cys 


TCC 
Ser 


ATT 
lie 


GTT 
Val 


GAA 
Glu 
95 


GAT 
Asp 


288 


GAA 
Glu 


AAA 
Lys 


AAT 
Asn 


AAT 
Asn 
100 


GGG 
Gly 


GGC 
Gly 


CAA 
Gin 


AAA 
Lys 


AGA 
Arg 
105 


GTG 
Val 


TAT 
Tyr 


GGT 
Gly 


TTG 
Leu 


TCA 
Ser 
110 


CAA 
Gin 


GTG 
Val 


336 


GGA AAA 
Gly Lys 


TTC 
Phe 
115 


TTT 
Phe 


GTT 
Val 


AAA 

Lys 


AAT 
Asn 


GAA 
Glu 
12C 


AAT 
Asn 


GGT 
Gly 


GCA 
Ala 


TCA 
Ser 


ATG 
Met 
125 


GGG 
Gly 


CCA 
Pro 


CTT 
Leu 


384 


TTG 
Leu 


GCT 
Ala 
130 


TTG 
Leu 


CTT 
Leu 


CAA 
Gin 


AAT 
Asn 


AAA 
Lys 
135 


GTA 
Val 


T^C 
Phe 


ATA 
lie 


AAC 
Asn 


AGC 
Ser 
140 


TGG 
Trp 


TTT 
Phe 


GAA 
Glu 


CTA 
Leu 


432 


AAA 
Lys 
145 


GAT 
Asp 


GCA 
Ala 


GTT 
Val 


CTT 
Leu 


GAA 
Glu 
150 


GGA 
Gly 


GGA 
Gly 


GTT 
Val 


CCA 
Pro 


TTT 
Phe 
155 


GAC 
Asp 


AGG 
Arg 


GTA 
Val 


CAC 
His 


GGT 
Gly 
160 


480 


GTG 
Val 


CAT 
His 


GCA 
Ala 


TTT 
Phe 


GAA 
Glu 
165 


TAT 
Tyr 


CCA 
Pro 


AAA 
Lys 


TCG 
Ser 


GAC 
Asd 
170 


CCA 
Pro 


AAA 
Lys 


TTC 
Phe 


AAT 
Asn 


GAT 
Asp 
175 


GTT 
Val 


528 
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TTC AAC AAG GCA ATG ATC AAT CAC ACA ACT GTA GTC ATG AAA AAA ATA 57 6 

Phe Asn Lys Ala Met lie Asn His Thr Thr Val Val Met Lys Lys lie 
180 185 190 

CTT GAA AAT TAC AAA GGT TTT GAG AAC CTT AAA ACT TTG GTT GAT GTT 624 
Leu Glu Asn Tvr Lys Gly Phe Glu Asn Leu Lys Thr Leu Val Asp Val 
195 ' 20C 205 

GGA GGT GGT CTT GGA GTT AAC CTC AAG ATC ATT ACA TCT AAA TAC CCC 67 2 

Gly Gly Gly Leu Gly Val Asn Leu I.ys Met He Thr Ser Lys Tyr Pro 
210 215 220 

ACA ATT AAG GGC ACT AAT TTT GAT TTG CCA CAT GTT GTT CAA CAT GCC 720 
Thr He Lys Gly Thr Asn Phe Asp Leu Pro His Val Val Gin His Ala 

225 230 235 240 

CCT TCC TAT CCT GGG GTG GAA CAT GTT GGG GGA GAT ATG TTT GAA AGT 7 68 

Pro Ser Tyr Pro Gly Val Glu His Val Gly Gly Asp Met Phe Glu Ser 
245 250 255 

GTT CCA GAA GGA GAT GCT ATT TTT ATG AAG TGG ATT CTT CAT GAC TGG 816 
Val Pro Glu Gly Asp Ala He Phe Met Lys Trp He Leu His Asp Trp 
260 265 270 

AGT GAT AGT CAC AAC CTC AAG TTG CTA AAG AAC TGC TAC AAG GCT CTA 0 64 

Ser Asp Ser His Asn Leu Lys Leu Leu Lys Asn Cys Tyr Lys Ala Leu 
275 280 285 

CCA GAC AAT GGA AAG GTG ATT GTT GTT GAG GCC ATT TTA CCA GTG AAA 912 
Pro Asp Asn Gly Lys Val lie Val Val Glu Ala He Leu Pro Val Lys 
290 295 300 

CCA GAC ATT GAC ACC GCA GTG GTT GGC GTT TCG CAA TGT GAT TTG ATC 960 
Pro Asp He Asp Thr Ala Val Val Gly Val Ser Gin Cys Asp Leu He 
305 310 315 320 

ATG ATG GCT CAA AAT CCT GGA GGC AAA GAG CGA TCG GAA GAG GAG TTT* 1008 
Met Met Ala Gin Asn Pro Gly Gly Lvs Glu Arg Ser Glu Glu Glu Phe 
325 330 335 

CGA GCC TTG GCT ACT GAA GCT GGA TTC AAA GGC GTT AAC TTA ATA TGT 1056 
Arg Ala Leu Ala Thr Glu Ala Gly Phe Lys Gly Val Asn Leu He Cys 
340 345 350 

TGT GTC TGT AAT TTT TGG GTC ATG GAA TTC TGC AAG TAG 1095 
Cys Val Cys Asn Phe Trp Val Met Glu Phe Cys Lys 
355 360 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide No.l 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CGTTTCGCAA TGTGATTTGA TC 22 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Synthetic oligonucleotide No. 2 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTCAAAATGA CATCCTTTCA TAC 23 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide No. 3 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

CTGAAGATGT CAATAGTTGC ATGGC 2S 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PAS1 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 

GGTCTAGAGG GCCTTTTAGA GTGTTTTTGT TAG 33 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS1 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

AAAGTCGACC GTCCACCTGT GCCAACAAT 29 



(2) INFORMATION FOR SEQ ID NO: 9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS2 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TGTTTGGTGT TATGCTTCCG TCCT 24 

(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 92 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS3 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AAAAAGCTTT TTTAGGATGG AGTACAGCC 29 

(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS4 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TTTAAGCTTA AAGAGAACCA GACAATATT 29 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

. (A) NAME /KEY : CDS 
(B) LOCATION: 1..60 

(D) OTHER INFORMATION: /function = preprotein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 61.. 60 

(D) OTHER INFORMATION: /function = preprotein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

atg aac ttc acc get ctg etc get gec gtc gee gec gec ttg gtc gga AS 
Met Asn Pho Thr Ala Leu Leu Ala Ala Val Ala Ala Ala Leu Val Gly 
15 10 15 

tct gec aac gec acc gec tgc acc gec acc cag cag acc get gcg tac 96 
Ser Ala Asn Ala Thr Ala Cys Thr Ala Thr Gin Gin Thr Ala Ala Tyr 
20 25 30 

aag aca etc gtg age ate ctg teg gac gcq teg ttc aac aag tgc tct 144 
Lys Thr Leu Val Ser lie Leu Ser Asp Ala Ser Phe Asn Lys Cys Ser 
35 40 45 

acg gat teg ggc tac tec atg ctg acg gec aag gec etc ccc acc acg 192 
Thr Asp Ser Gly Tyr Ser Met Leu Thr Ala Lys Ala Leu Pro Thr Thr 
50 55 60 



gcg cag tac aag etc atg tgc gcg tec acg gca tgc aac acc atg ate 240 
Ala Gin Tyr Lys Leu Met Cys Ala Ser Thr Ala Cys Asn Thr Met lie 
65 70 "75 80 

aag aag ate gtg acg ctg aac ccg ccc aac tgc gac ctg acg gtg ccc 288 
Lys Lys lie Val Thr Leu Asn Pro Pro Asn Cvs Asp Leu Thr Val Pro 
85 90 95 

acg age ggc ctg gcg etc aac gtg tac teg tac gcg aac ggc ttc teg 336 
Thr Ser Gly Leu Val Leu Asn Val Tyr Ser Tyr Ala Asn Gly Phe Ser 
-100 105 110 

gac aag tgc teg teg etc 354 
Asp Lys Cys Ser Ser Leu 
115 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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acc gcg tgc acc gcc acc cag cag acc get gcg tac aag aca etc gtg 48 

Thr Ala Cys Thr Ala Thr Gin Gin Thr Ala Ala Tyr Lys Thr Leu Val 

15 10 15 

age ate ctg teg gac gcg teg ttc aac aag tgc tct acg gat teg ggc 96 

Ser lie Leu Ser Asp Ala Ser Phe Asn Lys Cys Ser Thr Asp Ser Gly 

20 25 30 



tac tec atg ctg acg gcc aag gcc etc ccc acc acg gcg cag tac aag 144 
Tyr Ser Met Leu Thr Ala Lys Ala Leu Pro Thr Thr Ala Gin Tyr Lys 
35 40 45 



/ 



etc atg tgc gcg tec acg gca tgc aac acc atg ate aag aag ate gtg 192 
Leu Met Cys Ala Ser Thr Ala Cys Asn Thr Met lie Lys Lys lie Val 
50 55 60 

acg ctg aac ccg ccc aac tgc gac ctg acg gtg ccc acg age ggc ctg 240 
Thr Leu Asn Pro Pro Asn Cys Asp Leu Thr Val Pro Thr Ser Gly Leu 
65 70 1 75 80 

ctg etc aac gtg tac teg tac gcg aac ggc ttc teg gac aag tgc teg 288 
Val Leu Asn Val Tyr Ser Tyr Ala Asn Gly Phe Ser Asp Lys Cys Ser 
85 90 95 

teg ctg 294 
Ser Leu 



(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1620 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



FEATURE : 

(A) NAME / KEY : 

(B) LOCATION: 



COMT II promoter 
1. .1263 



FEATURE : 

(A) NAME /KEY: CDS megaspermine 

(B) LOCATION: 1264.. 1630 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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cgtccacctg tgccaacaat atagagacaa tttgctcgta tagtcagaaa gagtgtttta 60 

ctttttagtt gctttttagt gaatctactc ggtataaagt taaattagtg ggtcaataag 120 

tcgggtgaat agttaaagaa aacagtggtg agtttagctg tcaaataatt tcttcttttt 180 

cttgttttca cattagaaat caaaataaaa cacaagcttt ttgtatttat tttaacacaa 240 

gctaattata tgtttatatg ctggttaggt gaagtaaagc atgttatatg aggaaagtac 300 

gaagaaaatg tgccaattgt cgtgtacagc aaagcagcca gcacaagcaa attcgcactt 360 

gataagtggc taagtccact ttctagtgga cctagtggtt cactaacttt taccaaaaag 420 

gcaataattt gcaattcaaa aagaaaaaaq gaaaaaagaa aactagacag actttaacac 480 

accaactccc acaggaagca acaatgcaac tcacaaaagg aaaccgagtt tttccgcgac 540 

ggatctagaa tttgggttca ttctttacgc ttr.ttcgtat taaactcatt atatttgtat 600 

aattatgggt ttatattttt tatttattgt aatttttgta aaattttata tataagtgta 660 

tactccacgt ctccggatac tacattagcc tctagggttc ttaatactct tgttaaattg 720 

tccaggctcc aaacgcatgc tcgtttcaat tttaacggat gtttccgaac aactccaaat 780 

gttcaatgtt aggtgtgttt ggtgttaagc ttccgtccta ggttaataga atagataatt 840 

gttgtttctt acatagtttt gaacaatcgt cgccataaac taatttttag gatggaagct 900 

aatttttagg atggagtaca gcctaaggtt aaaatataac tataaaaaat atccataaaa 960 

ggtgaaattt aattagtaac atgaaaagat aaaactagtg ttatcggtca aactttcaaa 1020 

agagaaagaa ataactagac aaacttcaac aaccaacctg cccaacatgc tactgtgcaa 1080 

ttgaaaaata aacaaaagag aaccagacaa tatttcaacc aatattccat caagaaaacc 1140 

aattatgaca attcttaacc aaagtcacaa ctaacactta taaaaagcac taactcaact 1200 

gtacatgatt gtgaagccta acaaaaacac tctaaaaggc ctctagagga tccccggggt 1260 

acc atg aac ttc acc get ctg etc get gec gtc gec gec gec ttg gtc 1308 
Met Asn Phe Thr A] a Leu Leu A] a Ala Val Ala Ala Ala Leu Val 
1 5 10 15, 

gga tct gec aac gec acc gcg tgc acc gec acc cag caa acc get gcg 1356 
Gly Ser Ala Asn Ala Thr Ala Cys Thr Ala Thr Gin Gin Thr Ala Ala 
20 -25 30 

tac aaa aca etc gtg age ate ctg teg gac gcg teg ttc aac aag tgc 1404 
Tyr Lys Thr Leu Val Ser lie Leu Ser Asp Ala Ser Phe Asn Lys Cys 
35 40 45 

tct acg gat teg ggc tac tec atg ctg acg gee aag gee etc ccc acc 1452 
Ser Thr Asp Ser Gly Tyr Ser Met Leu Thr Ala Lys Ala Leu Pro Thr 
50 55 60 

acg gcg cag tac aag etc atg tgc gcg tec acg gca tgc aac acc atg 1500 
Thr Ala Gin Tyr Lys Leu Met Cys Ala Ser Thr Ala Cys Asn Thr Met 
65 70 75 

ate aaa aaa ate gtg acg ctg aac ccg ccc aac tgc aac ctg acg gtg 1548 
lie Lys Lys lie Val Thr Leu Asn Pro Pro Asn Cys Asn Leu Thr Val 
80 85 90 95 

ccc acg age ggc ctg gtg etc aac gtg tac teg tac cca aac ggc ttc 1596 
Pro Thr Ser Gly Leu Val Leu Asn Val Tyr Ser Tyr Pro Asn Gly Phe 
100 105 110 

teg gac aag tgc teg teg ctg taa 1620 
Ser Asp Lys Cys Ser Ser Leu 
115 



WO 00/56897 



15 



PCT/FROO/ 00714 



(2) INFORMATION FOR SEQ ID NO : 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PAS 2 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15: 

CGCGGATCCC CTTTTAGAGT GTTTTTGTTA GGC 33 

(2) INFORMATION FOR SEQ ID NO : 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS5 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ACGCGTCGAC GTTAGGGACA ATCTATAGTG TCAC 3 3** 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS 6 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ACGCGTCGAC GCTCCGAGGA TTTGGCTGTC GCGG 34 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Synthetic oligonucleotide PS7 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ACGCGTCGAC GCTGGTTAGG TGAAGTAAAG CATG ■ 34 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

, (A) LENGTH: 33 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS 8 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ACGCGTCGAC GCATGTTATA TGAGGAAAGT ACG 33 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : > 3 3 base pairs 

(B) TYPE: nucleotide 

(C) * STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS9 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ACGCGTCGAC GCAGCCAGCA CAAGCAAATT CGC 33 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS10 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ACGCGTCGAC GACTTTAACA CACCAACTCC C 31 
(2) INFORMATION FOR SEQ ID NO: 22: 



f 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS11 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ACGCGTCGAC CGGATCTAGA ATTTGGGTTC ATTC 3* 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) ' MOLECULE TYPE: Synthetic oligonucleotide PS12 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ACGCGTCGAC GTGTATACTC CACGTCTCCG GATAC 3S 

(2) INFORMATION FOR SEQ ID NO : 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS13 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

ACGCGTCGAC GTTCAATGTT AGGTGTGTTT GG 32 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PAS 3 



} 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CGCGGATCCG CTTAACACCA AACACACCTA ACATTG 36 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Synthetic oligonucleotide PS14 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26: 

ACGCGTCGAC CAGTGGTGAG TTTAGCTGTC 
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